NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Online scalable Gaussian processes with conformal prediction for guaranteed coverage," Proc. of Intl. Conf. on Acoust., Speech, and Signal Processing, Hyderabad, India, April 6-11, 2025.

Xu, J; Lu, Q; Giannakis, GB (April 2025, IEEE)

Full Text Available
Learning to (Learn at Test Time): {RNN}s with Expressive Hidden States

Sun, Y; Li, X; Dalal, K; Xu, J; Vikram, A; Zhang, G; Dubois, Y; Chen, X; Wang, X; Koyejo, S; et al (July 2025, International Conference on Machine Learning)

Full Text Available
DREAMDISTRIBUTION: LEARNING PROMPT DISTRIBUTION FOR DIVERSE IN-DISTRIBUTION GENERATION

Zhao, BN; Xiao, Y; Xu, J; Jiang, X; Yang, Y; Li, D; Itti, L; Vineet, V; Ge, Y (April 2025, ICLR 2025)

The popularization of Text-to-Image (T2I) diffusion models enables the genera- tion of high-quality images from text descriptions. However, generating diverse customized images with reference visual attributes remains challenging. This work focuses on personalizing T2I diffusion models at a more abstract concept or category level, adapting commonalities from a set of reference images while creating new instances with sufficient variations. We introduce a solution that al- lows a pretrained T2I diffusion model to learn a set of soft prompts, enabling the generation of novel images by sampling prompts from the learned distribution. These prompts offer text-guided editing capabilities and additional flexibility in controlling variation and mixing between multiple distributions. We also show the adaptability of the learned prompt distribution to other tasks, such as text- to-3D. Finally we demonstrate effectiveness of our approach through quantitative analysis including automatic evaluation and human assessment.
more » « less
Full Text Available
Graph Diffusion Transformer for Multi-Conditional Molecular Generation

Liu, G; Xu, J; Luo, T; Jiang, M (December 2024, Conference on Neural Information Processing Systems)

Full Text Available
Developing Soil Internal Erosion Indicator to Quantify Soil Internal Erosion around Defective Buried Pipes under Water Exfiltration

https://doi.org/10.1520/GTJ20240122

Abegaz, R; Wang, F; Xu, J; Pierce, T; Moody, M (January 2025, Geotechnical Testing Journal)

Full Text Available
LLM-TA: An LLM-Enhanced Thematic Analysis Pipeline for Transcripts from Parents of Children with Congenital Heart Disease

Raza, M Z; Xu, J; Lim, T; Boddy, L; Mery, C M; Well, A; Ding, Y (February 2025, https://doi.org/10.48550/arXiv.2502.01620)

Thematic Analysis (TA) is a fundamental method in healthcare research for analyzing transcript data, but it is resource-intensive and difficult to scale for large, complex datasets. This study investigates the potential of large language models (LLMs) to augment the inductive TA process in high-stakes healthcare settings. Focusing on interview transcripts from parents of children with Anomalous Aortic Origin of a Coronary Artery (AAOCA), a rare congenital heart disease, we propose an LLM-Enhanced Thematic Analysis (LLM-TA) pipeline. Our pipeline integrates an affordable state-of-the-art LLM (GPT-4o mini), LangChain, and prompt engineering with chunking techniques to analyze nine detailed transcripts following the inductive TA framework. We evaluate the LLM-generated themes against human-generated results using thematic similarity metrics, LLM-assisted assessments, and expert reviews. Results demonstrate that our pipeline outperforms existing LLM-assisted TA methods significantly. While the pipeline alone has not yet reached human-level quality in inductive TA, it shows great potential to improve scalability, efficiency, and accuracy while reducing analyst workload when working collaboratively with domain experts. We provide practical recommendations for incorporating LLMs into high-stakes TA workflows and emphasize the importance of close collaboration with domain experts to address challenges related to real-world applicability and dataset complexity.
more » « less
Full Text Available
Demystifying the Communication Characteristics for Distributed Transformer Models

Anthony, Q; Michalowicz, B; Hatef, J; Xu, J; Abduljabbar, M; Shafi, A; Subramoni, H; Panda, D (August 2024, Hot Interconnect 2024)

Full Text Available
Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering

Naeem, A; Li, T; Liao, H_R; Xu, J; Mathew, A M; Zhu, Z; Tan, Z; Jaiswal, A K; Salibian, R A; Hu, Z; et al (November 2024, https://doi.org/10.48550/arXiv.2411.17073)

Accurate diagnosis and prognosis assisted by pathology images are essential for cancer treatment selection and planning. Despite the recent trend of adopting deep-learning approaches for analyzing complex pathology images, they fall short as they often overlook the domain-expert understanding of tissue structure and cell composition. In this work, we focus on a challenging Open-ended Pathology VQA (PathVQA-Open) task and propose a novel framework named Path-RAG, which leverages HistoCartography to retrieve relevant domain knowledge from pathology images and significantly improves performance on PathVQA-Open. Admitting the complexity of pathology image analysis, Path-RAG adopts a human-centered AI approach by retrieving domain knowledge using HistoCartography to select the relevant patches from pathology images. Our experiments suggest that domain guidance can significantly boost the accuracy of LLaVA-Med from 38% to 47%, with a notable gain of 28% for H&E-stained pathology images in the PathVQA-Open dataset. For longer-form question and answer pairs, our model consistently achieves significant improvements of 32.5% in ARCH-Open PubMed and 30.6% in ARCH-Open Books on H\&E images.
more » « less
Full Text Available
Inferring HIV transmission patterns from viral deep-sequence data via latent typed point processes

Bu, F; Galiwango, R; Grabowski, K; Ratmann, O; Xu, J (February 2024, Biometrics)

Full Text Available
SE(3) Equivariant Convolution and Transformer in Ray Space

Y. Xu, J. Lei (September 2023, Thirty-seventh Conference on Neural Information Processing Systems)

Full Text Available

« Prev Next »

Search for: All records